Japanese Dependency Structure Analysis based on Lexicalized Statistics

نویسندگان

  • Masakazu Fujio
  • Yuji Matsumoto
چکیده

\Ve JH't'::wnt statistical rnodels of Japanese dependency analysis aud report results of some experiments to investigate the pnfonuancc of the models for the use fo a partical parsing system. The statistical modeb <:u'<' rather simple compared with the recent complex models <wd intesivcl,y usc lexical level information 1 such a.s morplH:mcs 1 and part-of-speech tags .. \Ve conducted several expcrimcllts to show the following properties of the modeb: lb performance of the models according to feature selection ;{I performance of the models as a partial pa.rsiHg s:y:::;-

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Probabilistic Coordination Disambiguation in a Fully-Lexicalized Japanese Parser

This paper describes a probabilistic model for coordination disambiguation integrated into syntactic and case structure analysis. Our model probabilistically assesses the parallelism of a candidate coordinate structure using syntactic/semantic similarities and cooccurrence statistics. We integrate these probabilities into the framework of fully-lexicalized parsing based on largescale case frame...

متن کامل

Bootstrapping Lexicalized Models in Memory-Based Dependency Parsing

Previous research has shown that a lexicalized parsing model incorporating words but no parts-of-speech can outperform a model involving partsof-speech but no words given enough training data for supervised learning. We show that the same effect can be achieved with a bootstrapping approach, where a mixed model trained on a small treebank is used to parse a larger corpus which is used as traini...

متن کامل

Stress-Strength and Ageing Intensity Analysis via a New Bivariate Negative Gompertz-Makeham Model

In Demography and modelling mortality (or failure) data the univariate Makeham-Gompertz is well-known for its extension of exponential distribution. Here, a bivariate class of Gompertz--Makeham distribution is constructed based on random number of extremal events. Some reliability properties such as ageing intensity, stress-strength based on competing risks are given. Also dependence properties...

متن کامل

Coordination Disambiguation without Any Similarities

The use of similarities has been one of the main approaches to resolve the ambiguities of coordinate structures. In this paper, we present an alternative method for coordination disambiguation, which does not use similarities. Our hypothesis is that coordinate structures are supported by surrounding dependency relations, and that such dependency relations rather yield similarity between conjunc...

متن کامل

A Fully-Lexicalized Probabilistic Model for Japanese Syntactic and Case Structure Analysis

We present an integrated probabilistic model for Japanese syntactic and case structure analysis. Syntactic and case structure are simultaneously analyzed based on wide-coverage case frames that are constructed from a huge raw corpus in an unsupervised manner. This model selects the syntactic and case structure that has the highest generative probability. We evaluate both syntactic structure and...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998